Intoxicated Speech Detection using MFCC Feature Extraction and Vector Quantization
نویسندگان
چکیده
This study has been done on a technique which is suitable for tapping the telephonic conversation from a remote location to identify intoxication and consequent impaired brain activity that may cause criminal events e.g. DUI (driving under influence). This technique is time efficient, easy to use, non–invasive for the peoples and affordable for law enforcement personnel, bartenders/servers, court of law, coworkers/supervisors, clinicians, teachers and individuals who need to identify the presence and level of intoxication state in other peoples. The peaks in log Mel Filter Bank are main cues for identifying the sounds of speech. If a person is found drunk and his/her voice shows a great deal of variation, then this study describes an effective unsupervised method for query-by-audio sample speaker retrieval firstly by extracting MFCC features and then VQ (vector quantization) algorithms on the alcoholic audios. This method is also supported by verifying some speech parameters (fundamental frequency, jitter, shimmer). A set of twelve mel-frequency cepstrum coefficients computed every 10ms and which resulted the best performance i.e. 95% recognition with each of 8 speakers. The superior performance of the mel-frequency cepstrum coefficients may be an attributed to the fact that they better represent the perceptually relevant aspects of the short-terms speech spectrum.
منابع مشابه
Automatic Speaker Recognition Using Fuzzy Vector Quantization
Speaker recognition (SR) is a dynamic biometric task. SR is a multidisplinary problem that encompasses many aspects of human speech, including speech recognition, language recognition, and speech accents. This technique makes it possible to use the speaker’s voice to verify his/her identity and provide controlled access to services. The Mel-frequency extraction method is leading approach for sp...
متن کاملSpeaker Dependent Word Recognition Using MFCC and VQ
The paper present effective method for recognition of digit, numbers. Most of speech recognition systems contain two main modules as follow “feature extraction” and “feature matching”. In this project, (MFCC) Mel Frequency Cepstrum coefficient algorithm is used to simulate feature extraction module. Using this algorithm, the Cepstral Coefficients are calculated on Mel frequency scale. VQ (vecto...
متن کاملComparative Study of MFCC And LPC Algorithms for Gujrati Isolated Word Recognition
The study performs feature extraction for isolated word recognition using Mel-Frequency Cepstral Coefficient (MFCC) for Gujarati language. It explains feature extraction methods MFCC and Linear Predictive Coding (LPC) in brief. The paper compares the performances of MFCC and LPC features under Vector Quantization (VQ) method. The dataset comprising of males and females voices were trained and t...
متن کاملAutomatic Speaker Recognition using LPCC and MFCC
A person's voice contains various parameters that convey information such as emotion, gender, attitude, health and identity. This report talks about speaker recognition which deals with the subject of identifying a person based on their unique voiceprint present in their speech data. Pre-processing of the speech signal is performed before voice feature extraction. This process ensures the voice...
متن کاملSpoken Language Identification Using Hybrid Feature Extraction Methods
This paper introduces and motivates the use of hybrid robust feature extraction technique for spoken language identification (LID) sys tem. The speech recognizers use a parametric form of a signal to get the most important distinguishable features of speech signal for recognition task. In this paper Mel-frequency cepstral coefficients (MFCC), Perceptual linear prediction coefficients (PLP) alon...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014